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Indian Standard 

ANALYSIS OF VARIANCE 
0. FOREWORD 

0.1 This Indian Standard was adopted by the Indian Standards Institution 
on 10 February 1975, after the draft finalized by the Quality Control and 
Industrial Statistics Sectional Committee had been approved by the 
Executive Committee, 

0.2 The technique of analysis of variance is an indispensable tool in the 
scientific and industrial research for the analysis of experimental data in- 
volving quantitative measurements and is particularly helpful when several 
independent sources of variation are present in the data such as the results 
obtained at different temperatures, duplicate determinations of the same 
material made by several analysts, measurements classified according to 
several sources of supply of raw material obtained from different vendors, etc. 

0.3 It is well-known that the observations obtained by repetitive experi- 
ments vary among themselves. The source of variation in the data may be 
due to various causes, assignable or chance. Using the analysis of variance 
techniques it is possible to estimate how much of the total variation in a set 
of data can be attributed to one or more assignable causes of variation, the 
remainder which is not attributable to any assignable causes of variation 
being classed as due to chance causes which produces the residual or error 
variation. 

0.4 This standard is a sequel to the 'Indian Standard on Statistical Tests 
of Significance' (IS : 6200-1971). To compare the means of two groups of 
observations and to assess whether the difference between them can be 
reasonably ascribed to chance, the t-test is used. When the comparison 
is to be made among the means of more than two groups of observations, 
resort to the analysis of variance technique is made. In fact, i-test is a 
particular case of analysis of variance. 

0.5 In reporting the result of a test or analysis, if the final value, observed 
or calculated, is to be rounded off, it shall be done in accordance with 
IS : 2-1960*. 



1. 15COPE 

1.1 This standard intends to give a brief outline of the general treatment 
of the analysis of variance technique with respect to some of the designs 

♦Rules for rounding off numerical values (revised). 
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which are more frequently used in industrial experimentation. The 
techniques have been illustrated with examples wherein the necessary 
computational details have also been given. 

2. TERMINOLOGY 

2.0 For the purpose of this standard, the following definitions shall apply. 

2.1 Standard Deviation — The square root of the quotient obtained by 
dividing the sum of squares of deviations of the observations from their mean 
by one less than the number of observations in the sample. 

2.2 Variance — Square of standard deviation. 

2.3 Degrees of Freedom (D. F.) — The number of independent com- 
ponent values which are necessary to determine a statistic. 

2.4 Nnll Hypothesis — The hypothesis (or assumption) of the equivalence 
(or no difference) among the effects of methods so that the samples emanate 
from the same lot. 

2.5 Level of Significance — The probability (or risk) of rejecting the null 
hypothesis when it is true. Conventionally, it is taken to be 5 percent or 
1 percent. 

2.6 Corrected Sum of Squares — The total of the squares of the devia- 
tions of the observations from their mean. 



3.1 Mathematical Model 

3*1*1 Before the application of the analysis of variance techniques to any 
experimental data, it is fundamental to have some knowledge of the mathe- 
matical model holding good for the particular investigation under 
consideration. Basically there are three jmodels, namely, fixed effects model, 
random effects model and mixed effects model {see also Appendix A) . 

3.1,2 In the fixed effects model, some of the assignable causes of variation 
in the experiment are deliberately chosen so that the results of the analysis 
are not amenable for generalization in that direction. In other words, when 
the effects are unknown constants (parameter) the model is called fixed 
effects model or model I. For example, in an inter-laboratory investigation 
on the checking of the precision of tensile testing machine, four laboratories 
possessing a particular brand of the machine may be intentionally chosen 
so that the results of the analysis will apply only to these four laboratories 
and any conclusion derived will not be applicable to all the laboratories in 
general. On the other hand, if the tensile testing machines of a large number 
of laboratories are to be investigated and due to limitations of facilities, the 
four laboratories chosen are a random sample of all the laboratories, then 
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the jnathematical model chosen, is that of a random effects model or model II. 
The mixed effects model, as the name itself indicates, is the combination of 
the earlier two models where some of the effects are of the fixed nature, the 
remaining being of random nature. The random effects model has exten- 
sive applications in most of the industrial experimentation. 

3.2 Additive Nature of Sum of Squares 

3.2.1 As the name implies the technique of analysis of variance consists 
in separating the total variance into parts, each part measuring the variabi- 
lity attributable to some specific source. For example in an inter-laboratory 
testing there may be variability among laboratories and variability within 
the laboratory. The latter, in turn, may be composed of a number of com- 
ponents of differing magnitude and importance like the variability among 
different analysts, among days, or among determinations made on the same 
day. Using the additive property of the variance the total variation between 
members of a set of observations, classified according to one or more criteria, 
can be broken up into components, attributable to different criteria of classi- 
fication which are of experimental interest or importance. By testing the 
significance of these components it is possible to determine which of the 
criteria are associated with a significant proportion of the overall variability 
in the averages. 

3.2.2 Planned experiments have proved to be very powerful and econo- 
mical in investigating the influence of various factors contributing to the 
total variance in the measurable characteristics of the product. In these, 
measurements are taken on a sample of units, which is so constituted as to 
ensure the simultaneous randomization in experimental error and variation 
due to changes in treatments. As long as they are clearly defined and 
reproducible these treatments may represent different materials or tempera- 
tures or processes or any variation in operating conditions. 

3.3 Orthogonality of Designs 

3.3.1 For the easy statistical analysis of the data resulting from any 
investigation, a desirable feature of the design adopted is its orthogonality. 
Orthogonality ensures that the different classes of effects shall be capable 
of direct and separate estimation without any entanglement. 

3.3.2 For example, in an experiment to observe the effect of varying two 
factors, say temperature and pressure, wherein the temperature is measured 
at four levels and pressure at three levels, there wilt be 12 experimental con- 
ditions generated by taking each level of temperature with each level of 
pressure. The comparison between the average of three results at any two 
temperatures will then be purely a measure of the effect of temperature (and 
vice versa) since the averages will have been taken over the same set of pres- 
sures. The two factors are said to be mutually orthogonal. 

Now by accident or design suppose that the combination corresponding 
to the highest temperature and pressure is missing from the experiment. 
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then die difference between the averages of the observations at the lowest 
and the highest temperatures could not be purely a measure of the effect of 
temperature because the first average is based on observations at three levels 
of pressure whereas the second observation is based on the average at only- 
two levels of pressure and hence the difference between the two averages will 
be, to an unknown extent, influenced by the effect of pressure. The experi- 
ment in this case is non-orthogonal. 

3.4 The details of the application of the technique of analysis of variance 
will vary with the number of independent causes of variation. It is possible 
to classiiy the data with respect to each independent source of variation and 
the complete classificadon is a necessary first step to the application of 
analysis of variance. The following sections describe the procedure for 
analysis of variance under different categories which are commonly met in 
practice. 

4. ANALYSIS OF VARIANGE -^ SD^GLE FACTOR OR ONE-WAY 
GLASSIFIGATION OF DATA 

4.1 Replicate Determinations 

4.1.1 Data will frequently be encountered where classification is based on 
one factor only, for example analysts or temperatures or batches of material. 
The data may consist of (a) replicate determinations of the same material 
made by several analysts in the same laboratory or (b) measurements ob- 
tained at different temperatures or (c) measurable characteristics of some 
material obtained in different lots or shipment from the same supplier, etc. 

4.1.2 In all these, there will be variation within replicates (unassignable 
variation in the system) which is a measure of the precision. There will 
also be a variation in means of results obtained under different conditions. 
It is due to the differences among analysts or differences in temperatures or 
lot-to-lot variation of the shipment. Analysis of variance helps to separate 
these effects and to determine whether there is any significant difference 
between operators or temperatures or shipments, as the case may be. 

4.1.3 Usually, in the interest of experimental efficiency and simplicity of 
analysis, it is desirable to have the same number of replicate determinations 
for each class. But owin" to a lack of desi*5Ti or to the loss of T»art of data 
or natural grouping of experimental material or perhaps due to deliberate 
placing of emphasis on certain effects, the number of observations in various 
classes may be unequal. The following two examples illustrate the method 
of analysis when there are (a) unequal number of observations in each class, 
and (b) equal number of observations in each class. 

4.1.4 Example 1 — The following data represent the warpway breaking 
strength of Type II Indian hessian [see IS : 2818 (Part II)- 1971*] measured 

•Specification for Indian hessian : Part H 305 and 229 g/m' at 16 percent contract regain 
(Jerst revision)* 
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in units of kg. Products of three different mills A, B and G were tested and 
the test results are reproduced below : 

Mill Warpwqy Breaking Strength No, of Total Mem 

(kg) Observations 

A 87.96,99,94,91 5 467 93-4 

B 93, 103, 90, 93, 99, 88, 100, 91 8 757 94-6 

G 98,88,84 3 270 90-0 

Total 16 1494 

The various calculations needed for forming the required analysis of 
variance are as follows : 

a) Uncorrected total sum of squares : (87)2+ (96)2+ (84)^= 139 940 

KN TT .A f K *, Ml (467)2 (757)« (270)* 

b) Uncorrected sum of squares between mills : ^^ — g-^+- x -- — h ;; 

J o o 



(1 494)s 



= 139 548-93 



c) Correction factor : i — ^^-^-=139 502 25 

d ) Total sum of squares : (a) — (c) = 43 7 • 75 

e) Sum of squares between mills : (b) — (c)= 46-68 

f) Sum of squares within mills : (d) — (e}=39107 

The degrees of freedom corresponding to the various entries of the 
analysis of variance table are obtained as follows : 

Degrees of freedom for total sum of squares = Total number of 

observations — 1 = 16 — 1 
= 15 
Degrees of freedom for between mills = Total number of mills— 1 

=3-1=2 
Degrees of freedom for within mills = Total degrees of freedom — 

degrees of freedom for between 
mills=15-2=13 





Analysis 


OF Variance Table 




Source of 
Variation 


Degrees of 
Freedom 


Sum of 
Squares 


Mean 
Square 


F Ratio 


Between mills 
Within mills 


2 
13 


46-68 
39107 


23-34 
30-08 


23-3^ 0.78 
36-08 ^^^ 


Total 


15 


437-75 







To test the hypothesis that there is no appreciable variation between the 
^,.^-«„^ 1^ „u;„- „4. „„*T — f *i ^j,,„*„ ^f +i,„ j;ir * :ii„ *t,« ..„_;„ 

avt^iagv Liiuaiviii^ aLi(^ij,gtii y/i Livv; uiuiaut^is ui iiiu laiiii^i i^itL iiuiia i.ii«j vaixaixv^c 

23-34 
ratio is formed as F= - v ^ =0-78. Since this is less than 3'81 which is 

the tabulated value of F for 2 and 13 degrees of freedom at 5 percent level, 
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there is insufficient evidence to reject the hypothesis. Therefore, it may 
be concluded that there is no appreciable variation in the breaking strength 
of the products of the three mills. 

4.1.5 Example 2 — The following data are from 5 replicate runs on the 
time of passage of aluminium powder through 6 different test units. It is 
endeavoured to test whether there is any significant difference between the 
6 test units when compared with the variation within units : 



Run 



Unit- 



52-9 


540 


52-6 


50-5 


54-6 


54-0 


52-3 


53-8 


53-2 


50-8 


54-6 


53-3 


52-2 


53-8 


53-4 


50-7 


54-4 


53-7 


52-5 


53-6 


53* 


50-8 


54-4 


53-5 


52-7 


53-6 


530 


50-5 


54-4 


53-7 



The data can be coded by subtracting 50 from each value and multiplying 
the remainder by 10 to remove the decimal. Coding by adding or sub- 
tracting a constant has no effect on the calculation of the corrected sum of 
squares or the variance. Coding by multiplying by a constant will change 
the resulting variance by the square of this constant. However, since the 
mean squares are tested by a ratio of two calculations, the coding factor will 
cancel. If ultimately it is intended to use the calculated variances, they 
will have to be decoded by dividing by 100. 

The coded data and the analysis of variance are as follows : 



Run 



Unit- 



1 


29 


40 


26 


5 


46 


40 


2 


23 


38 


32 


8 


46 


33 


3 


22 


38 


34 


7 


44 


37 


4 


25 


36 


34 


8 


44 


35 


5 


27 


36 


30 


5 


44 


37 



Total 


126 


188 


156 


33 


224 


182 


909 


Mean 


25-2 


37-6 


31-2 


6-6 


44-8 


36-4 





+ (35)2+(37)2 



The various calculations are obtained as below ; 
a) Uncorrected total sum of squares : (29) ^+(23)^4- 

=32 119 
, b) Uncorrected sum of squares between units : ^ [(126)2 4-(188)^+ .... 

(182)2] = 31 989 
('909') 2 

c) Correction factor: ^^--r^=27 542-7 

Hence the corrected sum of squares are obtained as : 

d) Total sum of squares : (a)-(c)=32 119-27 5427=4 576-3 

e) Sum of squares between units : (b) — (c)=31 989—27542-7=4446-3 
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f) Sum of squares within units : (d)—(e)=4 576-3— 4446-3 = 130-0 
The degrees of freedom for the various entries of the analysis of variance 
table are obtained as follows : 



Total sum of squares 
Between units 
Within units 



30-1=29 

6-1= 5 

29-5=24 





Analysis of Variance Table 




Source of 
Variation 


D.F. 


Sum of 
Squares 


Mean 
Square 


F Ratio 


Between units 
Within units 


5 
24 


4 446-3 
130-0 


889-26 
5-42 


-- = ,«.... 


Total 


29 


4 576-3 







**Highly significant. 

Tabulated F value for 5 and 24 degrees of freedom at 01 level is only 3-90. 
The F value as obtained from the data is highly significant. Hence there is a 
strong evidence of a factor between units which causes a variation in the 
results greater than that which can be accounted for by variation within 
units. 

It may now be desirable to compare all possible pairs of means to find out 
which of the means or how many of them differ significantly from the others 
to cause the overall variation among means to be significant. For this 
purpose use is made of ^test for difference between means, taken two at a 

time. With n means, there are ■ —^ comparisons (number of combina- 
tions of n things taken two at a time) . In the present example there will be 

6x5 ,. 

—^ = 15 comparisons. 

The residual variance gives a measure of the precision of a single 
measurement. This measure is identical with the pooled estimate 
of the standard deviation. In the present example it is obtained as 
V5 -42— 2-327. Standard deviation of the mean of five measurements is 

-%- times the standard deviation of single measurement and is equal to 

V 5 
2-327 

V 5 

The critical diflference, useful for finding out whether any two means are 
significantly different or not, is obtained as t^x V2x 1 040 6 where t^^ is the 
tabulated value of the t distribution for 24 degree of freedom at the 5 percent 
level, which is obtained as 2064. The critical difference is thus obtained 

9 
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as 3 04. It may, however, be noted that the use of critical difference is 
valid only when the F ratio in the analysis of variance table is found to be 
significajit. After obtaining the critical difference the various means are 
arranged either in the ascending or the descending order of magnitude as 
follows : 



Unit No. 4 13 6 2 5 

Means 6-6 25-2 31-2 36-4 376 44-8 

Any two means not under scored by the same line are significantly 
different. Thus in the present example only the means between unit 6 and 
unit 2 are not different. Hence it may be concluded that all other units 
give results which are significantly different from one another. 

4.2 Single Glassificatioiii with Snbgrouping (Nesting) 

4.2.1 Often the principal classification in the data can be divided into 
sub-classifications which do not cut across the main classes. Each sample 
may be composed of sub-samples and these, in turn, may consist of sub-sub- 
samples. The repeated sampling and sub-sampling give rise to nested 
sampling or hierarchical classification. For example if we draw 1 bales of 
wool at random from a shipment and take three cores of wool from each 
bale, the cores are said to be nested within bales. Again several analysts 
might draw two, three or four specimens (not necessarily the same number 
for each analyst) from a batch of material and run several replicate analyses 
on each specimen. The replicate analyses give an estimate of the error 
variance. The specimens run by the same analyst give a measure of varia- 
tion within the batch plus the error. The differences between the results 
of the several analysts include not only the variation between analysts but 
also the variation between specimens and the error variation. Same situa- 
tion holds good if several test pieces are cut from each of several rolls of 
fabric made wholly from several different machines from each of several 
plants. As long as there is no relation between the corresponding members 
of the different groupings, the analysis of variance by subgroups (or nesting, 
as it is sometimes called), applies. Where there is relationship between the 
members of the subgroups, so that the corresponding division of each sub- 
group can be considered as a separate class, we have an analysis of variance 
for more than one main classification. The general arrangement of data 
for single classification with several hierarchies of subgrouping is illustrated 
by the following example. 

4.2.2 Example 3 — A series of trials is made by three operators to locate 
a source of variation in a chemical analysis. The procedure consists in 
taking a specimen, treating it in a combustion-tube furnace and performing 
the chemical analysis. In the test, three operators each took two specimens 
and made three combustion trials on ^ach specimen £ind titrated each trial 
in duplicate. The single letter A represents the operator factor, the double 

10 









Data for Example 


3 










A 




1 


2 


3 


B{A) 




11 


12 


21 


22 


31 


32 


C{AB) 




III 112 113 


121 122 123 


211 


212 213 


221 222 223 


311 312 


313 


321 


322 323 


Duplicate 
Insults 




/ 156 151 154 
1 154 154 160 


148 154 147 
150 157 149 


125 
125 


94 98 

95 102 


118 112 98 184 172 
124 117 110 184 186 


181 
191 


172 
176 


181 175 
184 177 


Sub Totals 


i) 


310 305 314 


298 311 296 


250 


189 200 


242 229 208 


368 358 


372 


348 


365 352 




") 


929 


905 


639 


679 


1098 


1065 




iii) 


1834 


1318 


2 163 



Grand Total 



5315 






to 
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letter B(A) indicates the first or second specimen taken by the operator, and 
the triple letter C{AB) indicates the combustion trial on each specimen 
by each operator. The results are shown in duplicate. 
Uncorrected sums of squares are obtained as follows : 

a) Total sum of squares: (156)2+{151)H • • • • +(184)2+(177)2 

=817 085 

b) Combustion within operators and specimens C{AB) : 

(310)^+(305)^+. . . . +(365)='+ (352)^ _ g^g ^^^ 

c) Specimen within operator B{A): 

(929)^+(9Q5)^+... + (1098)H(lQ65)^ _ 815 209 
6 



d) Between operator : 

(1834)^+(1318)^+(2 163)^ _g,^^^, 

Correction factor : ^^ }?^ = 784 700 

DO 

Analysis of Variance Table 



Source of Variation D.F, Sum of Squares Mean F 

Square Ratio 

Operators^ 2 814 937-784 700=30 237 15 168 15 168 ,,^^^ 

Specimen within operators 3 815 209-814 937== 272 91 

B(A) 

Combustion within operators 12 816 778-815 209=1569 131 131 _„ _,^ 

& specimens C(^fi) jy — 7'7 

Replicates, error 18 (By subtraction) 307 17 

Total 35 817 085 - 784 700= 32 385 

♦•Highly significant. 

From the analysis of variance table it may be concluded that the largest 
source of variation in results is between operators. There is no evidence of 
variation between specimens. There is a definite source of variation in 
combustion step in analysis. 

5. ANALYSIS OF VARIANCE - TWO-WAY CLASSIFICATION OF 
DATA 

5.1 If a series of experiments is run at different temperatures and different 
pressures or if material from several sources of supply is tested under a 
variety of conditions or if a group of operators makes a series of runs on a 

12 
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number of pilot plants, two-way classification of data is obtained. In such a 
case, one of the characteristics can be represented along the rows and the 
other along the columns. Further, each of the cells formed by the two-way 
classification can have either one or more than one observation. 

5.2 Two-Way Classification with One Observation in Each Cell 

5.2.1 The analysis of variance for this arrangement of data is similar to 
that for the single factor arrangement. The column-factor effect is cal- 
culated from the squares of column totals and the row factor effect, from the 
squares of row-totals. The following example illustrates the method of 
analysis. 

5.2.2 Example 4 — Six samples Of dextrose monohydrate were analysed 
in each of the seven laboratories for copper content (measured as ppm). 
The data obtained by the investigation is given below : 



'-..^^arr^lt 


1 


2 


3 


4 


5 


6 


Total 


iar^<-^ 
















1 


0-3 


0-2 


0-1 


0-7 


0-5 


0-4 


2-2 


2 


0-9 


0-9 


0-3 


0-3 


0-3 


0-8 


3-5 


3 


0-8 


1-9 


0-6 


0-4 


0-4 


1-1 


5-2 


4 


0-6 


0-6 


0-2 


0-3 


03 


0-3 


2-3 


5 


0-5 


0-2 


0-2 


0-2 


0-5 


0-5 


2-1 


6 


0-4 


0-4 


0-5 


0-4 


0-6 


0-6 


2-9 


7 


0-5 


0-6 


01 


10 


1-2 


1-2 


4-6 



Total 



40 



4-8 



2-0 



3-3 



3-8 



4-9 



22-B 



The uncorrected sum of squares are calculated as follows : 

a) Total sum of squares : (0-3)2+ (0-2)''+ +(l-2)2=17-56 



b) Between laboratories 
sum of squares 

c) Between samples 
sum of squares 

Correction factor 



(2-2)^ , (3-5)' 

6 
(40)2 



6 
(4-8)2 



+ 



7 ' 
(22-8)g 

42 '' 



7 
1238 



+ ^^U-90 

(4-9)2 
7 



+ 



13-20 



Analysis of variance table is formed as follows : 
Analysis of Variance Table 



Smrce of Variation 



D.F. 



Sum of Squares 



Mean 
Square 



F Ratio 



Between rows (laboratories) 


6 


13-90- 12-38= 1-52 


0-25 


Between columns (samples) 


5 


13-20- 12-38=0-82 


016 


Error 


30 


(by subtraction) 2-84 


009 



0-25 
0-.09 
016 
0-09 



2-7S 



= 1-78 



Total 



41 



17-56- 12-38=5- 18 
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From the analysis of variance table, it can be concluded that there is no 
significant difference between either the laboratories means or the sample 
means. 

5.3 Two-Way Classification vdth Multiple Observations in Each 
Cell — In this type of classification more than one observation is obtained 
in each cell formed by the rows in the columns, that is, repeated measure- 
ments are made on different randomly selected individuals. 

5.3.1 Replication facilitates fuller analysis of the data and the precision 
of the experiment increases with replication. 

5.3.2 Jn the two-way classification with more than one observation per 
cell, besides the variation due to row and column effects there would also 
be an interaction effect which is the result of different row column combina- 
tions. As all the different combinations from the groups of the two factors 
of classification play their part in the experiment, they also contribute to 
the total variability. This interaction becomes one of the sources of variabi- 
lity which must be taken into account in the analysis of variance. Supposing 
the criteria of classification are different varieties of material subjected to 
different treatments. In such a situation the same variety can be differently 
affected by different treatments and the same treatment can show different 
effects with different varieties. This combination effect is the interaction. 
The following example illustrates the method of analysis. 

5.3.3 Example 5 — For studying the accuracy of water meters for con- 
tinuous rate of flow, 5 water meters were tested by each of the two operators 
A and B. Each operator made five repeat observations on each meter. 
The observations in terms of the percentage accuracy of the meters are 
given below : 



Operator 






Water Meter Number 




Total 




1 


2 


3 


4 


1 
5 




A 




+ 0-5 


- 3 


- 2-5 


2 


- 3 








- 1 


- 2-5 


- 3-5 


2 


~ 3-5 








- 1 


- 3 


- 1-5 


2 


- 3 








- 1 


- 2 


- 2 


2 


- 3 






Cell total 


- 1 


- 3 


- 2-5 
-120 


2 


_ 3 






- 3-5 


-13-5 


100 



"15-5 


-345 


B 


+ 0-5 


- 3 


- 3 


- 3 








- 3 


- 2 


- 2 





- 3 








- 2 


- 2-5 


- 2 





- 4 








- 1 


- 30 


- 2 





- 3-5 






Cell total 


- 1-5 


- 2 


„ 1 


2 


- 3 






- 7-0 


-12-5 


-100 


20 


-16-5 


-44-0 




Total 


-10-5 


-260 


-220 


120 


"32-0 


-78-5 
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From the above table the various sums of squares are obtained as follows : 

a) Uncorrected total sum of squares=(0-5)2+(— l)*-f- (—3)^ 

=270-75 

Correction factor (CF) = ^"^^'^^^ =123-24 

Hence the corrected total sum of squares=270-75— 123-24= 147-51 

u\ T* . . f (-10-5)2 , (-32-0)2 _„ 
bj Between meters sum of squares = ^^ — —--Z, _|- ^ — ^ qy 

= 243-82-123-24 
= 120-58 

c) Between operators sum squares = -^^ — n^~ + ^ — 9=-^^ CF 

= 12505— 123-24=1 -Si 

d) Interaction sum of squares 

(-3-5)2 (16-5)2 

— f . . . .-^^ — ^ — GF— between meters sum of 



5 ' •■■• 5 
squares— between operators sum of squares 
- 25205-123-24-120-58-1-81 
= 6-42 

The analysis of variance table is then formed as follows : 

Source of Variation Degree of Sum of Mean F Ratio 

Freedom Squares Square 

30- 14 
Between meters (Af) 4 120-58 30-14 rhLi3= 18-84** 

1-60 
|,pi 
Between operator (O) 1 181 1-81 ris = 113 



4 

1 


120-58 
181 


30-14 
1-81 


1-60 
1-81 
1-60 


4 
40 


6-42 
IB- 70 


1-60 
0-47 


160 

0-47 



Interaction (Afx 0) 4 6-42 1-60 ^^^^ = 3-40* 

Error 

Total 49 147-51 

**Highly significant. 
♦Significant. 

When the interaction is tested against the error it is found to be highly 
significant thereby indicating the presence of interaction. It may hence be 
interpreted that different water meters behave differently with the change of 

i T„ ..: _r*u:^ £„ j: ^i i^ j. 3 r_„ ^i 1 3 

upciatui. ±11. view ux liiu luiuxug, uici'C is au UlgClXL UCCU lUI' LiXC piu^cuuxc 

for testing of the water meters to be standardized. Since the interaction 
is significant, under the random effects model chosen for the experiment, 
meters and operators are to be teste4 against the interaction. This testing 
reveals that between operators variation is not significant whereas between 
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meters variation is highly significant. The latter finding is perhaps to be 
expected, since the meters had originated from different manufacturers. 

5.4 Two-Way Classification with Sub-grouping (Nesting and 
Replication) 

5.4.1 Sub-grouping ot nesting of data within a main classification can 
occur with data that are collected under two main classifications. For 
example, if several factories are producing the same product on a batch 
basis and each makes several batches at two or more conditions then a situa- 
tion illustrated by nesting is obtained. 

5.4.2 The batches from each factory are not related to batches from other 
factories and for the purpose of analysis of variance they are simply sub- 
groups of the factory classification. Any variation between factories would 
include the variation between batches within factories. The condition 
factor is an independent classification and its effect on the variation of 
results is reflected in the difference between factories, that is to say, only in 
so far as there is interaction between the conditions and factory factors or 
the condition and batch factors. 

5.4.3 Sub-grouping can exist under either factor or both the factors in 
the same set of data. Secondary sub-grouping can exist within the first 
sub-groups. In fact, there can be any hierarchy of sub-groupings under 
both main classes of factors. An illustrative example is given wherein sub- 
grouping occurs under both the factors. 

5.4.4 Example 6 — For studying the effect of storage time and packing on 
the moisture content of corn flakes, two types of packings namely, polythene 
bags and polythene bags in cartons were chosen and the periods selected 
were 2, 4 and 6 months. Six different samples of corn flakes belonging to 
the same batch of manufacture were analysed in duplicate after storing 
them for 2, 4 and 6 months in the two types of packings. The resultant 
data is given below : 



Storage Period 




Packing Type 1 






Packing Type 


2 


2 months 


Sample 1 
6-17 
611 


Sample 2 
4-40 

4-72 


Sample 3 
4-22 
3-80 

802 


Sample 1 
4-66 
5-.00 


Sample 2 
3-98 
400 


Sample 3 
6-22 
5-40 


Cell Total 


12-28 


912 

4-46 
4-85 

9-31 


9-66 


7-98 


11-62 


4 months 


5-50 
5-48 


4-61 
4-25 

8-86 


4-52 
4-43 


456 
3-17 


5-88 
4-83 


Cell Total 


1098 


8-95 


7-73 


10-71 


6 months 


6- 10 
6-13 


5-80 
5-73 


6-79 
7-73 


6-89 
6-74 


7-12 
7-06 


6-55 
6-65 



Cell Total 



12-23 



11-53 



14-52 



13-63 



1418 



13-20 
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From the above data a sub- table of the following type is formed to assist 
in the computations. 

Sub-table 



Storage Period 


Packing 


Total 




Typel 




Type 2 




2 months 
4 months 
6 months 


29-42 
29-15 
38-28 




29-26 
27-39 
41-01 


58-68 

56-54 
79-29 


Total 


96-85 




97-66 


194-51 



The various sum of squares are then obtained as follows : 

a) Uncorrected total sum of squares=(617)2+ . .(6-65)2=1 095- 181 9 

fl94-51')2 
Correction factor (C¥)=i^~~~^=l 050 948 3 

Corrected total sum of squares^ 1 095181 9 — 1 050-948 3 =44233 6 

K^ n . V f (96-85)2 , (97-66)2 

b) Between packmg sum 01 squares =-^ — j-—^ — \- - — j-jr— ^ — CF 

= 1050 9666-1050-9483=0 0183 

N -D . -A r (58-68)2 , (56-54)2 (79-29)2 ^^ 

c) Between periods sum of squares =- — pr-^ — \- - — p— — \- ^ — r— — ^ CF 

= 1077-2515-1050-9483=26 3032 

d) Interaction (packing X period) sum of squares 

(29-42)2 , (41-01)2 

= fi~^"^ • " • "^ — fi — — ^'^ —between packmg sum of squares 

— between period sum of squares 
= 1078-1328-1050 9483-0 0183-26 3032 
=0 8630 

e) Between samples (within packing and period) sum of squares 
^ [" (12-28)2 (13-20)2 -1 _ [- (29-42)2 (41 01)2 -1 

L 2 "^■•*" 2 J L 6 ■^•••' 6 J 
= 1 092 521 4-1 078 132 8=14-388 6 

The analysis of variance table is then formed as follows : 
From the analysis of variance table, between sample sum of squares 
(within packing and period) is tested against error and the F ratio so obtained 
is highly significant. Because of this fact both the main effects due to pack- 
ing and period as also the interaction are tested against between sample 
(within packing and period). The testing reveals that the mean value of 
moisture content for between periods is highly significant, corroborating 
the general presumption that corn flakes gather moisture depending on the 
period of storage. The analysis also reveals that there is no significant 
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difference in the moisture content of corn flakes stored in two different 
packings. 

Analysis of Varunce Table 

Source of Variation Degree of Sum of Mean F Ratio 

Freedom Squares Square 

Between packing 1 0018 3 0018 3 Z ^ ^ 0-002 

tot e n c 

Between periods 2 26-303 2 13-1516 ,":^q ° = 10-97** 

1 ■ 1 99 

0-431 5 
Packing period 2 0-863 0-4315 ^'^ =0-36 

Between samples (within packing 12 14-388 6 1199 _lll2L9^8.ii«* 

and period) 0- 147 8 

Error 18 2-660 5 0-147 8 

(by 
subtraction) 

Total 35 44-233 6 

**Highly significant. 

5.5 Three-Way Classification — If there are three different sources of 
variation (factors of classification) which are Ukely to act independently of 
one ainother and contribute to the total variability of the data, then the 
total sum of squares has to be split up into three components due to these 
sources. The analysis of variance follows the same formulation as that of a 
two factor arrangement. When more than two factors are involved, all 
possible combinations of interactions may exist and a complete analysis of 
variance provides mean squares attributable to all the main factors and all 
individual interactions. 

5.6 Factorial Experiments — When the effect of several variables on a 
product or process is of interest, it is possible to devise experiments where 
all of them may be studied simultaneously. For each variable, a number of 
categories or levels may be chosen for study. If an equal number of obser- 
vations is made for all possible combinations of levels (one level from each 
variable), the experiment is called factorial. In a completely balanced 
experiment, each level of each factor is tested at all the levels of all the other 
factors so that the total number of observations required is the product of 
all the levels and all the factors. Thus in a factorial experiment to study the 
wear resistance of vulcanized rubber wherein five qualities of filler (factor 
A), three methods of pretreatment of the rubber (factor B) and four qualities 
of raw rubber (factor C) are involved, the total number of observations for a 
complete experiment turns out to be 5 x3x4=60. When more than two 
factors are involved, all the possible combinations of interactions may exist 
and a complete analysis of variance provides mean squares attributable to 
all individual interactions. 
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6. CLASSIFICATION OF HIGHER ORDER 



6.1 There is no limit to the number of main factors like different treatments, 
materials, laboratories, temperatures, pressures, catalysts or concentrations, 
that may be examined in the same experiment. A suitable design of ex- 
periment is a pre-requisite in such studies. 



APPENDIX A 

[Clause 3.1) 

FIXED, RANDOM AND MIXED MODELS 

If ATijk is the dimension of kih. component produced on the ixh machine 
onjth day in a plant, then it may be written that 

where 

(J, = overall mean dimension, 
flj = effect due to iih machine, 
6j = effect due to jth day, 

c\\ = effect due to interaction of iih machine and jth day, and 
^lik = random effects which are independently normally distri- 
buted with mean and variance cret- 
in the random effect model where the machines under study are con- 
sidered as a random sample from a large number of machine as also the days 
are the randomly chosen ones, «i, b\, and c^ are all assumed to be indepen- 
dently normally distributed with means and respective variances a\^, 
a^ and ctab*. 

In the fixed effects model where the conclusions are to be drawn only on 
the few machines that are under study and on the specific days chosen in 
the experiment, it is assumed that 

E ai=Z bi^i: Cii=Z cii=0 

i i i i 

In a mixed effect model where the machines are considered as a random 
sample from a large nuniber of machines, but the days are those specifically 
chosen, aj and cii are assumed to be independently normally distributed 
with mean and variances <ta^ and ctab^ and E bi=S Ci^—O. 

j j 
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